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Description 

Method and device for speech processing 



The development of workaday speech recognition systems 
and speech control systems has for years been one of 
the main lines of development of computer technology. 
In the course of this development, substantial advances 
have been achieved and marketable speech recognition 
systems have been established which are also proving 
themselves in practical use. Advanced systems of this 
type are also fundamentally suited for speech control 
of a computer and/or of connected peripherals. Simple 
speech recognition systems, which can, however, process 
only a relatively small vocabulary, are also already in 
use in the sectors of consumer electronics and motor 
vehicle equipment, as well as further sectors in which 
acoustic control of equipment on the basis of a limited 
vocabulary is possible and sensible. 

As a rule, in the case of speech recognition systems 
there are tools which can be used to input the 
vocabulary to be recognized by the speech recognition 
system. As a rule, the words or utterances are input in 
orthographic notation via an appropriate interface 
software of the computer program and are automatically 
converted into the internal notation of the speech 
recognition system (mostly a variant of phonetic 
transcription (phonetic script) ) . In this conversion 
process, which is automatic or supported by lexicon 
look-up, errors can occur in the phonetic transcription 
which arise from inadequate conversion rules and/or 
incomplete lexica. Since the speech recognition system 
builds up its recognition process on the basis of the 
phonetic transcription thus generated, an incorrect 
phonetic transcription also produces errors in the 
speech recognition. 



In order to ensure optimum performance, it must be 
ensured that the phonetic transcription is as correct 
as possible. 



5 The problem has so far been solved in that the user has 
been able to check manually the phonetic transcription 
generated by the system after inputting of the 
orthographic (correct) notation. However, this is 
difficult, as a rule, for untrained staff. 
10 Consequently, use has been made of various aids on 
offer in SW on the market : 

1. The user can have displayed for himself words which 
are typical of the various phonetic symbols and in 

15 which such symbols are contained, and can correct the 
Q phonetic notation manually. In this case, he is further 

2 supported in a few systems to the effect that no 

S incorrect character sequences of the phonetic 

]i5 transcription can be used, since the software employed 

W 2 0 can input only those character strings which represent 

I ^ a valid ASCII sequence for the phonetic character set 

m used. 

2. The phonetic transcription is converted again into 
25 an audible speech from the phonetic notation with the 

aid of text-to-speech software systems, that is to say 
speech synthesizing methods. This serves the purpose of 
the acoustic plausibility check of the phoneme string 
which has been automatically generated by the system 
3 0 for a word. This audible test can, however, eliminate 
only drastic errors and is subject to the shortcomings 
of the acoustic channel . Moreover, it is necessary to 
ensure correspondence between the phonetic alphabets 
used in the speech recognition as also in the speech 

3 5 synthesis, and this is so in very few cases. 

The invention is therefore based on the object of 
specifying an improved method and a device for speech 
processing which are distinguished, in particular, by a 

4 0 substantially improved user- friendliness and, in 
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conjunction therewith, also by enhanced accuracy and 
reliability. 

This object is achieved with regard to the aspect of 
its method by a method having the features of claim 1, 
and with regard to the aspect of its device by a device 
having the features of claim 6. 

The invention includes the essential idea of replacing 
the outputting of a word converted into phonetic 
transcription, something which is unfamiliar to, and 
can be handled only with difficulty by the 
linguistically untrained user, in this phonetic 
transcription (phonetic script) by an outputting which 
is simple and can be handled more reliably. It further 
includes the idea of selecting for this purpose an 
output form which is to be denoted as "pseudo- 
orthographic" and does not demand of the user knowledge 
of special characters of the phonetic transcription and 
of their special rules. Put simply, the outputting of 
the converted words is performed "in the way they are 
spoken" . 

This pseudo-orthographic outputting, which is easy to 
understand even for the layman and can be effectively 
handled, of a language converted into phonetic 
transcription requires an additional step in the speech 
processing method, specifically the step of conversion 
from the phonetic transcription into this pseudo- 
orthographic representation. This additional step 
includes a method in the case of which the phonetic 
units of the words are converted, in a self -learning 
fashion or with access to a predetermined set of rules, 
into simple graphemic units of written script. This 
conversion is performed in a simple and expedient 
embodiment by accessing a stored phoneme /grapheme 
assignment table which is initialized at least with an 
initial stock of assignment rules and can, if 
appropriate, be extended by the user in the course of a 



self -learning process during the application of the 
system on the basis of additional inputs. 

In a particularly convenient design which is 
advantageous for the purpose of the self -learning 
process mentioned, the method also comprises a further 
conversion step of reverse conversion into the phonetic 
transcription from a pseudo-orthographic representation 
(employed by the user when inputting for the purpose of 
correcting the primary conversion result) . The tabular 
assignment mentioned can also be used in this step and, 
if appropriate, can be supplemented and refined in the 
course of a self -learning process. 

In accordance with the method features specified above, 
in addition to a first converter unit known per se for 
converting an orthographic input into the phonetic 
transcription, a device for carrying out the proposed 
method has a second converter unit for converting from 
the phonetic transcription into the pseudo-orthographic 
representation mentioned, and an output unit for 
outputting in this form of representation. 

The device has an appropriate third converter unit for 
the abovementioned development of the method, which 
permits the user to make a correcting input by using 
the pseudo- orthographic representation. 

In order to apply the phoneme/grapheme assignment table 
mentioned, in a preferred embodiment the device has an 
appropriate memory in which this assignment table is 
held accessibly for the second and/or third converter 
unit . 

Advantages and expedient features of the invention 
emerge for the rest from the subclaims and the 
following description of a preferred exemplary 
embodiment with the aid of the figure. 



The figure shows a schematic illustration of a speech 
processing device 1 for carrying out the method 
according to the invention in an embodiment in the form 
of a functional block diagram. The speech processing 
device 1 comprises an acoustic input unit 3 at whose 
output a preprocessed stream of speech SI is present 
which is fed to an input of a speech recognition unit 5 
which outputs a written text S2 . The speech recognition 
unit 5 comprises a vocabulary memory 5a in which the 
vocabulary of the speech recognition unit is stored in 
the phonetic notation customary in conventional speech 
recognition systems. 

The vocabulary memory 5a is continuously updated by the 
input of additional terms by means of an alphanumeric 
input unit 7, which terms are converted from the 
orthographic input format in a first converter unit 9 
into the phonetic transcription (phonetic script) . A 
lexicon memoir/ 11 supports the conversion procedure in 
the first converter unit 9, For the purpose of checking 
and correcting undertaken inputs, a second converter 
unit 13 is provided for converting the phonetic 
transcription into a pseudo- orthographic 

representation. This is indicated on a display screen 
15 for the user. 

Also provided is a third converter unit 17 for 
converting pseudo-orthographic inputs via the 
alphanumeric input unit 7 into phonetic notation, the 
output of which is connected to the vocabulary memory 
5a of the speech recognition unit 5. The second and 
third converter units 13, 17 are assigned an assignment 
memory 19, organized in the form of a look-up table, 
for predetermined phoneme /grapheme assignments. 

An input, performed by the user, of a new term in 
correct orthographic notation is converted in the first 
converter unit 9 into phonetic script and can - 
depending on the actual organization of the system - 
already be fed in this form to the vocabulary memory 



5a. In each case, the word converted into phonetic 
script is fed, however, to the second converter unit 
13, where a further conversion into a pseudo- 
orthographic representation is performed, which is 
displayed on the display screen 15 and causes the user, 
if appropriate via the input unit 7 - now in the 
pseudo-orthographic representation, which also appears 
on the display screen - to make a correcting input, or 
else to confirm the displayed pseudo- orthographic 
representation. The pseudo-orthographic input is 
converted in the third converter unit 17 into phonetic 
script and now (for the first time or, if the word has 
already been taken over into the vocabulary memory 5a 
on the occasion of the first input, in a correction 
mode) fed to the vocabulary memory 5a. The contents 
thereof are thereby expanded by a word checked with 
regard to the phonetic notation. 

The procedure described above is explained below using 
two examples : 

1st example 

"Jacques Chirac" is input in correct orthographic 
notation via the alphanumeric input unit 7. The 
phonetic notation "sh a xk sh i : rr a xk" is formed 
therefrom in the first converter unit 9. The second 
converter unit 13 forms "sch a k sch i r a k" 
therefrom, and the input name is displayed on the 
display screen 15 in this notation. It is possible - 
without knowing the phonetic alphabet used in the first 
conversion - to perceive from this representation that 
the phonetic notation generated by the system is 
adequate. The user can confirm the conversion result, 
and the newly input name passes (in phonetic notation) 
into the vocabulary memory 5a. 



2nd example 



"Professional Service" is input via the input unit 7. 
The first converter unit 9 generates therefrom in 
phonetic notation 

"pro: f ae sh o n :e: 11 s oe r v i : cc :e". In the 
result of the further conversion in the second 
converter unit 13, ^' Prof aschonell Sorwieke" is yielded 
therefrom in pseudo-orthographic notation, and this 
representation is again displayed on the display screen 
15 . 

The user perceives straight away that the phonetic 
script generated by the system cannot be correct, since 
it does not correspond to the usual pronunciation of 
the input word combination. The user will now use the 
input unit in conjunction with the pseudo-orthographic 
notation, which is illustrated on the screen, to 
undertake a correction, and the correction result is 
converted again in the third converter unit 17 from the 
pseudo- orthographic notation into the phonetic one, and 
taken over in this form into the vocabulary memory 5a. 
In the example given, the user will therefore input 
"Prof aschonnell Sorwis", and the new word combination 
(in phonetic notation) is anchored in the vocabulary 
memory . 

It is to be seen that the specified method can also be 
carried out in a plurality of steps when, after a first 
correction by the user, a further conversion from the 
phonetic notation into the pseudo -orthographic one is 
performed in conjunction with a further display in this 
representation such that, if appropriate, system errors 
can be eliminated iteratively. In this case, it is 
preferred to apply a self -learning system - known per 
se - for example in the form of a neural network with 
the aid of which a self -adaptation of the memory 
contents of the assignment memory 19 and/or the 
assignment rules of the first conversion operation 
(orthographic - phonetic) can be performed. 



The design of the invention is not limited to the 
example described above, but is also possible in a 
multiplicity of modifications which are within the 
scope of expert activity. 
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Pat ent c 1 aims 



1. A method for speech processing, in which an 
orthographic input is converted into a phonetic 
transcription in a first conversion step, and a step of 
checking and correcting the conversion result by the 
user is provided, characterized by a second step of 
converting from the phonetic transcription into a 
pseudo-orthographic representation and outputting in 
this representation . 

2- The method as claimed in claim 1, characterized by a 
third step of converting an input performed in the 
pseudo- orthographic representation into the phonetic 
transcription . 

3. The method as claimed in claim 1 or 2, characterized 
in that the second and/or third conversion step 
comprises a conversion of phonetic word units into 
simple graphemic script units, or vice versa. 

4. The method as claimed in claim 3, characterized in 
that the second and/or third conversion step is 
executed by accessing a stored phoneme /grapheme 
assignment table (19) . 

5. The method as claimed in claim 3 or 4, characterized 
in that the second and/or third conversion step is 
executed by means of a self -learning method, in 
particular by using a neural network for continuous 
updating of the phoneme /grapheme assignment table (19) . 

6. A device (1) for carrying out the method as claimed 
in one of the preceding claims, having an alphanumeric 
input unit (7) and a first converter unit (9) , 
connected to the latter on the input side, for 
converting an orthographic input into a phonetic 
transcription, and a display unit (15) for optically 
displaying an input word, characterized by a second 
converter unit (13) for converting from the phonetic 



transcription into a pseudo- orthographic 

representation, which is connected on the output side 
to the display unit. 

7. The device as claimed in claim 6, characterized by a 
third converter unit (17) for converting an input 
performed in the pseudo-orthographic representation 
into the phonetic transcription. 

8. The device as claimed in claim 6 or 7, characterized 
in that the second and/or third converter unit (13, 17) 
is connected to a memory (19) for storing a 
phoneme /grapheme assignment table. 

9. The device as claimed in one of claims 6 to 8, 
characterized in that the second converter unit (13) is 
connected on the output side to a vocabulary memory 
(5a) of a speech recognition unit (5) . 
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Declaration and Power of Attorney For Patent Application 
Erklarung Fur Patentanmeidungen Mit Vollmacht 

German Language Declaration 



Als nachstehend benannter Erfinder erklare ich hiermit 
an Eides Statt: 



dass mein Wohnsitz, meine Postanschrift, und meine 
StaatsangehSrigkeit den im Nachstehend en nach 
meinenn Namen aufgefQhrten Angaben entsprechen, 



dass ich, nach bestem Wissen der ursprungliche, erste 
und alleinige Erfinder (falls nachstehend nur ein Name 
angegeben ist) oder ein ursprunglicher, erster und 
Miterflnder (falls nachstehend mehrere Namen 
aufgefOhrt sind) des Gegenstandes bin, fur den dieser 
Antrag gestellt wird und fur den ein Patent beantragt 
wlrd fiir die Erflndung mit dem TItel: 

Verfahren und Vorrichtuna zur 



Sprachverarbeitung 



deren Beschreibung 

(zutreffendes ankreuzen) 
□ hier beigefOgt ist 
M am 11.04.2000 als 
PCT internationale Anmeidung 

PCT Anmeldungsnummer 

eingereicht wurde und am . 



PCT/DEOO/01116 



abgeandert wurde (falls tatsachlich abgeandert). 



Ich bestatige hiermit, dass ich den Inhalt der obigen 
Patentanmeldung einschllesslich der AnsprQche 
durchgesehen und verstanden habe, die eventuell 
durch einen Zusatzantrag wie oben erwShnt abgean- 
dert wurde. 



Ich erkenne meine Pflicht zur Offenbarung irgendwel- 
Cher Informationen, die fur die PrQfung der vorliegen- 
den Anmeidung in Einkiang mit Absatz 37, Bundes- 
gesetzbuch, Paragraph 1.56(a) von WIchtigkeit sind, 
an. 



Ich beanspruche hiermit auslandische Prioritatsvorteile 
gemass Abschnitt 35 der Zivilprozessordnung der 
Vereinlgten Staaten, Paragraph 119 aller unten ange- 
gebenen Auslandsanmeldungen fur ein Patent oder 
eine Erfindersurkunde, und habe auch alle Auslands- 
anmeldungen for ein Patent oder eine Erfindersurkun- 
de nachstehend gekennzeichnet, die ein Anmelde- 
datum haben, das vor dem Anmeldedatum der 
Anmeidung liegt, fQr die PrioritSt beansprucht wlrd. 



As a below named inventor, I hereby declare that: 



My residence, post office address and citizenship are 
as stated below next to my name, 



I believe I am the original, first and sole inventor (if only 
one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the 
subject matter which is claimed and for which a patent 
is sought on the invention entitled 



Method and device for speech 
processing 



the specification of which 

(check one) 

□ is attached hereto. 

S was filed on 11.04.2000 



as 



PCT international application 

PCT Application No. PCT/DE0Q/01116 

and was amended on 



(if applicable) 



I hereby state that I have reviewed and understand the 
contents of the above identified specification, including 
the claims as amended by any amendment referred to 
above. 



I acknowledge the duty to disclose information which is 
material to the examination of this application in 
accordance with Title 37, Code of Federal Regulations, 
§1. 56(a). 



I hereby claim foreign priority benefits under Title 35, 
United States Code, §119 of any foreign application(s) 
for patent or inventor's certificate listed below and have 
also identified below any foreign application for patent 
or inventor's certificate having a filing date before that 
of the application on which priority Is claimed: 



Form PTO-FB-240 (8-83) 
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Prior foreign appplications 
Prioritat beansprucht 




Priority Claimed 


19931050.5 DE 


06.07.1999 


H □ 


(Nummer) (Land) 


(Day Month Year Filed) Yes No 
(Tag Monat Jahr eingereicht) Ja Nein 


fNumber^ fCountrv^ 
(Nummer) (Land) 


I — 1 i — 1 

□ □ 

(Day Month Year Filed) Yes No 
(Tag Monat Jahr eingereicht) Ja Nein 


^Numbert ?Countrv) 
(Nummer) (Land) 


□ □ 

(Day Month Year Filed) Yes No 
(Tag Monat Jahr eingereicht) Ja Nein 


Ich beanspruche hiermit gemass Absatz 35 der Zivil- 
prozessordnung der Vereinigten Staaten, Paragraph 
120, den Vorzug aller unten aufgefQhrten Anmel- 
dungen und falls der Gegenstand aus jedem Anspruch 
dieser Anmeldung nicht in einer frtiheren 
amenkanischen Patentanmeldung laut dem ersten 
Paragraphen des Absatzes 35 der Zivilprozefiordnung 
der Vereinigten Staaten, Paragraph 122 offenbart ist, 
erkenne ich gemSss Absatz 37, Bundesgesetzbuch, 
Paragraph 1.56(a) meine Pflicht zur Offenbarung von 
Informationen an, die zwischen dem Anmeldedatum 
der frOheren Anmeldung und dem nationalen Oder PCT 
internationalen Anmeldedatum dieser Anmeldung 
bekannt geworden sind. 


1 hereby claim the benefit under Title 35. United States 
Code. §120 of any United States application(s) listed 
below and, insofar as the subject matter of each of the 
claims of this application is not disclosed in the prior 
United States application in the manner provided by 
the first paragraph of Title 35. United States Code, 
§122, i acknowledge the duty to disclose material 
Information as defined in Title 37, Code of Federal 
Regulations, §1. 56(a) which occured between the filing 
date of the prior application and the national or PCT 
international filing date of this application. 


PCT/DEOO/01116 


11.04.2000 


anhanaia pendinq 


(Application Serial No.) 
(Anmeldeseriennunnmer) 


(Filing Date D. M, Y) 
(Anmeldedatum T, M, J) 


(Status) (Status) 
(patentiert, anhSngig, (patented, pending, 
aufgegeben) abandoned) 


(Application Serial No.) 
(Anmeldeseriennummer) 


(Filing Date D.M.Y) 
(Anmeldedatum T, M; J) 


FORMULARTEXTFORMULART 

EXTFORMULARTEXTFORMULARTEXTFORMULARTEXTFORMUL 
ARTEXTFORMUIjWTEXTFORMUL 


Ich erklSre hiermrt, dass aile von mir in der vorliegen- 
den Erklarung gemachten Angaben nach meinem 
besten Wissen und Gewissen der vollen Wahrheit 
entsprechen, und dass Ich diese eidesstattliche Erkla- 
rung in Kenntnis dessen abgebe, dass wissentlich und 
vorsStzlich falsche Angaben gemass Paragraph 1001, 
Absatz 18 der Ziviiprozessordnung der Vereinigten 
Staaten von Amerika mit Geldstrafe beiegt und/oder 
Gefangnis bestraft warden koennen, und dass derartig 
wissentlich und vorsatzlich falsche Angaben die GQI- 
tigkeit der vorliegenden Patentanmeldung oder eines 
darauf erteiften Patentes gefShrden konnen. 


1 hereby declare that all statements made herein of my 
own knowledge are true and that all statements made 
on information and belief are believed to be true, and 
further that these statements were made with the 
knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may 
jeopardize the validity of the application or any patent 
issued thereon. 
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VERTRETUNGSVOLLMACHT: Als benannter Erfinder 
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Patentanwalte) und/oder Patent-Agenten mit der 
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mit der Abwicklung ader damit verbundenen GeschSfte 
vor dem Patent- und Warenzeichenanfit: (Name und 
Reglstratlonsnummer anfOhren) 



POWER OF ATTORNEY: As a nanned inventor, ! 
hereby appoint the following attorney(s) and/or 
agent(s) to prosecute this application and transact all 
business in the Patent and Trademark Office 
connected therewith. (M name and registration 
number) 



Customer No. 25227 



And I hereby appoint 



Telefongesprache bitte richten an: 
(Name und Telefonnummer) 



Direct Telephone Calls to: (name and telephone 
number) 



Ext. 



Postanschrift: Send Correspondence to: 

Morrison and Foerster LLP 
2000 Pennsylvania Ave., NW 20006-1 888 Washington, DC 
Telephone: (001) 202 887 1500 and Facsimile (001) 202 887 0763 

or 

Customer No. 25227 



Voller Name des einzigen Oder ursprQnglichen Erfinders: 


Full name of sole or first inventor: 

GERHARD NIEDERMAIR 


Urffirs^rift des Erfinders * Datum 


Inventor's signature 


Date 


•Vli^KIRCMEM. DEUTSCHLAND 


Residence 

VIERKIRCHEN, GERMANY 


Staatsangehdrigkeit 

DE 


Citizenship 

DE 


Postanschrift 

LUDWIG-THOMA-STR. 9 


Post Office Addess 

LUDWiG-THOMA-STR. 9 


85256 VIERKIRCHEN 


85256 VIERKIRCHEN 


Voller Name des zwelten Miterfinders (falls zutreffend): 


Full name of second joint inventor, if any: 


Unterschrift des Erfinders Datum 


Second inventor's signature 


I^e 


Wohnsitz 


Residence 
1 


Staatsangehdrigkeit 


Citizenship 


Postanschrift 


Post Oftice Address 







YL 



(Bitte entspreciiende Informationen und Unterac/iriften im 
Fafie von dritten und weiteren M^erfindern angeben). 



(Supply similar information and signature for third and 
subsequent Joint inventors). 
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